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CORRELATION OF SOME PSYCHOLOGICAL AND 
EDUCATIONAL MEASUREMENTS 



PROBLEMS 

"The results of all good experimental work will live, but as 
yet most of them are like hieroglyphics awaiting their decipher- 
ing Rosetta Stone." These are the words of Spearman. Such 
words are true of all fields of research, but they are worse than 
true of the field of Correlational Psychology. The Rosetta Stone 
of Correlational Psychology must do more than interpret ; it must 
reconcile. For this nothing less than a Philosopher's Stone will 
suffice, and Science, succeeding Black Magic, fully realizes that 
such a stone will not be found, but must be formed by a slow 
and laborious process. It is the hope that this study will con- 
tribute its small part to the making. 

Correlational Psychology is in this more or less chaotic condi- 
tion, not only because of poor experimental technique and diverse 
and inadequate statistical methods, but also because of the very 
great complexity, importance, and number of the problems which 
it has elected to attack. Such complexity, importance, and num- 
ber of problems is revealed by a very brief survey of the litera- 
ture on correlation. But not to go farther afield, it is excellently 
illustrated by the problems which it is the purpose of this re- 
search to examine. These problems follow: 

1. What are the intercorrelations among our psychological and 
educational tests or the functions which they measure ? 

2. What is the relative value of each test as a measure of 
mental ability ? 

3. In the practical measurement of mental ability for educa- 
tional and vocational purposes which tests are the more valuable ? 
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2 Correlation of Psychological and Educational Measurements 

4. In the construction and in the application of psychological 
tests for the measurement of mental ability, do 'speed* tests or 
'power 1 tests offer more promise, whether as to correlation, con- 
venience, or time spent ? 

5. What characteristics in a test make for high correlation with 
mental ability ? 

6. What is the value of improvement as a measure of mental 
ability ? 

7. What is the significance of chronological age as an intel- 
lectual index? 

8. Is there such a thing as a negative correlation between de- 
sirable traits? Is the law of human nature correlation or 
compensation ? 

9. Do our results support Spearman's "Theorem of the Uni- 
versal Unity of Intellective Function," or Burt's "Hierarchy of 
the Specific Intelligences"? 

These problems have been attacked experimentally. The fol- 
lowing pages describe the experiment, the use made of the data, 
and the results obtained. This experiment was devised originally 
to study problems other than those considered here. In fact, 
this study was not even conceived until the experiment was com- 
pleted. While this means a certain roughness of technique, it 
has the advantage of guaranteeing the impartiality of the data. 
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EXPERIMENTAL MATERIAL AND METHOD 

I. Subjects 

The subjects for this experiment were eighty-eight public 
school children of an average age of about twelve and one-half 
years and about equally divided as to sex. These eighty-eight 
children were two typical 6B classes in a typical elementary school 
in New York City. The two class rooms adjoined and the teach- 
ers who had charge of the children used the departmental method 
of instruction. That is, the two teachers divided the subjects to 
be taught equally between them and each taught her allotted sub- 
jects to both classes. In this way both classes received exactly 
the same instruction. The classes were equal in mental ability 
as measured by what is later described as the six preliminary 
tests, though the last fact is not essential to this study. Further, 
it should be noted that while children were at the beginning 
shifted from one room to the other in order to make the classes « 
equal in ability, in no case were children specially brought in 
from other classes. The eighty-eight children who made up the 
two classes were the children the experimenter found there when 
he began the experiment — they were typical classes. 

2. Tests with Their Administration and Scoring 

The general plan of the experiment was to give six preliminary ■ 
tests, to follow these with an extended practice series, and to con- 
clude with six final tests which were to be similar to, but not }■ 
identical with, the six preliminary ones. Certain special tests 
were given along with the practice series without interrupting it. 

In the administration of the tests every effort was made to 
treat both classes exactly alike. This was all the easier because 
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a test in one room was followed immediately by the same test in 
the other room. Written instructions were used at the beginning 
of each new test to avoid unconscious variation. During the 
practice series each class was tested for about half an hour. The 
testing began in one room half an hour after lunch and was con- 
cluded in the other room half an hour before the children were 
dismissed. The beginning class on one day would be the conclud- 
ing class on the following day. A teacher was always present 
when the children were being tested, though she took no part in 
the administration of the tests. The entire experiment was con- 
ducted by the author with the exception of the six preliminary and 
six final tests. Each of these sets was given to both classes in 
one day. This required an assistant, but even here the writer 
started every test and left the assistant to collect the papers. 

This experiment was throughout a group experiment, there 
being no individual testing. The detailed method for the practice 
series was as follows : The experimenter entered the class room 
~v and announced the names of the three pupils making the highest 
scores in each of the tests on the previous day. In addition to 
the regular procedure, if a new test were beginning, instructions 
were read and what was to be done was illustrated. Otherwise, 
the monitors distributed material face down. At the signal: 
Hands Up! all raised their hands. At the signal: Go! all began 
the test. At the signal: Stop! all ceased immediately, wrote 
their names and identification numbers on the sheets and turned 
them over to the monitors, who did the collecting. This was 
repeated for the other tests of that day, after which the experi- 
menter went through a similar procedure with the other class. 
The tests used on any one day during the practice series, the 
number of days they were used, the dates they were used, together 
with the average score made by both classes in each test are all 
shown in Table A. A brief description of the tests employed, 
the time allowed for each, and the method of scoring are given 
below. 

Preliminary and Final Tests 

Visual Vocabulary: The children were given the Thorndike 
Reading Scale A, which contains forty-three words. The first 
five words are easy and equally difficult. Each succeeding group 
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of five words grows progressively more difficult. The last group, 
consisting of only three words, is the most difficult of all. Thus 
both the lower and upper limits of the ability of the children were 
measured. The children were to write the letter F under every 
word that meant a flower, and the letter A under every word that 
meant an animal, and so on. In this as in all the preliminary 
and final tests the time allowance was thirty minutes. If a child 
completed a test, leaving nothing undone, before the expiration 
of the half-hour, he could hand his paper to the experimenter. 
This last rule held not only for all the preliminary and final 
tests but also for the special tests which were sprinkled along 
during the practice experiment. The Visual Vocabulary was 
scored in terms of penalties : 

Score = Errors + Omissions. 

The final Visual Vocabulary Test was similar to, though not 
identical with, the one just described. The two tests were ad- 
ministered and scored in exactly the same way. 

Reading: Thorndike's Reading Scale Alpha was used. This 
scale contains four paragraphs, each one being more difficult to 
comprehend than the preceding. Each paragraph was followed 
by several questions. The child's written answers to these ques- 
tions were taken as a measure of his comprehension of the para- 
graph. A complete sentence was not required of the child, one 
word sometimes being sufficient to express the idea. Time al- 
lowed : 30 minutes. 

Score = 2 (correct answers) -f- I (semi-correct answers). 

The final Reading Test is similar. I, J, K and L of Thorndike's 
longer Reading Scale were used. The scoring was identical. 

Completion: The Trabue Completion Test, consisting of 
twenty-eight mutilated sentences, was used. The difficulty of 
completing the first sentence is small, but there is a gradual in- 
crease in difficulty with each succeeding one. The child was to 
write in the missing word or words. Time allowed : 30 minutes. 

Score =: 2 (sentences completed correctly) -|- 1 (sentences 
completed semi-correctly). 

A similar set of twenty-eight sentences was employed in the 
same way for the final test. 

Arithmetic: Six problems in arithmetic, which grew progres- 
sively more difficult, were selected for this test. The child 
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handed in his work with his answers, but only the answers which 
were correct received a score. 

Score = Number of problems correctly solved, 

Six similar problems were used for the final test. 

Omnibus I A: The Omnibus Test is so called because it rep- 
resents a compilation by Professor Thorndike of several tests 
which psychology has found valuable. These are Easy and Hard 
Opposites, Verb-Object, Supraordinate, Mixed Relation, Easy 
and Hard Direction, and Addition. Time allowed: Thirty min- 
utes. The method of scoring this as all the other Omnibus Tests 
varied with each special part, hence it would be tedious to give it. 
The method used was that devised by Professor Thorndike. 
Anyone who desires to use these tests is referred, for a copy 
of the method of scoring, to the Department of Educational 
Psychology, Teachers College. 

The Final Test was Omnibus I B which includes the same 
tests as the one just described, the only difference being a slight 
variation of the tasks. 

Omnibus II A: This tested reasoning ability, the ability to give 
the opposites to certain hard words, the ability to give a verb to 
a specified subject and to add the proper letters to unfinished 
words, and the ability to solve certain problems in arithmetic. 
Time allowed : Thirty minutes. 

Omnibus II B or the Final Test is a slight variation of Omni- 
bus II A. 

Special Tests 

Proverb: The Proverb Test was recently devised by Professor 
H. A. Ruger. It consists of thirteen English proverbs followed 
by their corresponding African proverbs. In some the similarity 
is easy to perceive; in others it is more difficult. The children 
were to match the proverbs. Time allowed : Fifteen minutes. 

Score = Number correctly matched. 

Other special tests were given from time to time but since these 
tests were not given twice they have not been used in this study. 
It is necessary that there be two measures of a function if a corre- 
lation is to be corrected for attenuation. The Ruger Proverb 
Test has been retained just because it was recently devised. 

Age: Because of its possible significance, the age of reaching 
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the grade has been used as a measure of the children. This age 
measure was taken from the official school record, and is ex- 
pressed in months. 

School Mark: This measure was an average of all the marks 
given by the two teachers to each child in each subject taught 
during the semester in which this experiment was being carried 
on. No previous marks have been used. 

Teacher Rank: The two teachers were each asked to rank the 
eighty-eight children for mental ability. These ratings were 
made independently, although it must be remembered that the 
teachers had often talked together concerning the children. 

Practice Tests 

Cancellation of s's: For this the Woodworth-Wells Cancella- 
tion Sheet was used. This sheet contains a series of groups of 
five figures arranged in random order. The children were di- 
rected to cancel the figure 2. Time allowed : One minute. 

Score = 2 (number cancelled correctly) — 2 (number omit- 
ted) — 3 (number wrongly marked). 

Cancellation of 3's: Exactly the same test as the above, except 
that the children cancelled the figure 3. 

Cancellation of A's: On the Cancellation A Sheet fifty capital 
letter A's were arranged at random among other letters of the 
alphabet of which there were fifty each. The children cancelled 
the letter A. The time allowed and the scoring were as in the 
Cancelling 2 Test. 

Cancellation of S's: In every respect the same as the preceding 
test except that the letter S was cancelled. 

Addition: The Addition sheet employed by Thorndike, Kirby, 
and others was used in this test. It is made up of columns of 
ten one-place numbers arranged in random order, no figure less 
than 2 being used. The children were to write the sum of each 
column of figures. Four similar sheets were rotated to prevent 
memorizing. Time allowed : Ten minutes. 

Score = Number of columns added correctly. 

Copying Addresses: This test was recently devised by Profes- 
sor Thorndike. A sheet containing twenty-five names arranged 
in alphabetical order was given to each child, together with the 
small directory from which the names were taken. The children 
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found in the directory the New York City address and wrote it 
beside the appropriate name. A different list of names was 
used each day. Time allowed: Ten minutes. 

Score = Number of addresses correctly copied. 

Handwriting: Similar paragraphs were cut from the Youth's 
Companion and pasted on cards. Each child was given a para- 
graph and a sheet of blank paper with directions to copy as much 
of the paragraph as he could while writing as well as he could. 
This test was given twice each day, a new paragraph being used 
each time. It need hardly be said that in this test as well as 
the others all the children did exactly the same thing in any one 
test. Time allowed : Four minutes for each test. 

Score as i (number of lines or fraction of lines copied) minus 
i-io (each omission or error). 

Each omission or error counted as one (i). 

Any word or words omitted were of course deducted from 
the gross number of lines covered to get the figure which was 
substituted in the first parenthesis above. 

Miscellaneous Arithmetic: The children worked for twenty 
minutes each day in Thorndike's booklet "Exercises in Arith- 
metic No. 5." Since this test has never been accurately scored 
it was of little value for this study, consequently no further men- 
tion will be made of it. 
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STATISTICAL TREATMENT OF RESULTS 
I. Raw and Corrected Arrays 

The net original scores from the tests used in this study are 
given in the Appendix. In order that a coefficient of correlation 
might be calculated from these original data, it was necessary 
to reduce to one figure the many measures obtained from a prac- 
tice test. No such reduction was necessary for the data obtained 
from the preliminary, final, and special tests, because each of 
these was given but once. Further, in order to get a true coeffic- 
ient of correlation two measures of every function were necessary 
for each individual tested. This was simple in the case of the 
preliminary tests. The score made by each child in the pre- 
liminary test which was given February 3 was paired with the 
score made by the same child in the corresponding final test 
given April 28. The ability rank given by one teacher was 
paired with the rank of that same child given by the other teacher. 
School marks made in arithmetic, geography, and spelling were 
totaled and paired with the total of marks made in grammar, 
composition, and reading. Omnibus I A and Omnibus I B, being 
so much alike, were combined and paired with the sum of Om- 
nibus II A and Omnibus II E. Of the other special measures — 
Ruger Proverb and the Age of Reaching the Grade — no second 
measure was available. In the case of the practice tests the 
scores made by any one child on days 1, 3, 5, etc., were added 
and averaged. With this was paired the number obtained from 
summing and averaging the scores made by that same child on 
days 2, 4, 6, etc. The practice test — Cancellation of S's — was 
given an odd number of days, so day I was omitted as being 
the one most likely to be unreliable. 
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Statistical Treatment of Results II 

An 'array' is simply a column of figures to be correlated 
with some other column which permits of pairing by individuals. 
These arrays may be measures of the same function or of differ- 
ent functions. The preceding paragraph describes the method 
used in constructing what may be called the 'raw arrays.' Ob- 
viously, many factors may enter to make it impracticable or 
impossible to calculate a coefficient of correlation from such ar- 
rays. In the case of a practice test, for example, an individual 
might be absent on the last few odd days. This would probably 
make the first member of the pair smaller than the second. Or, 
again, one or more individuals might be absent on a day when a 
preliminary, final, or special test was given. Since each of these 
tests was given but once, obviously the absent individuals would 
have no score at all in that function. Since it was desired that 
every test be correlated with every other test, the raw arrays 
were examined, and whenever any individual was found who 
lacked a score for any preliminary, final, or special test, that 
individual was entirely eliminated from this study. Whenever, 
in the case of the practice tests, any individual had been absent 
more than two odd days or two even days, that individual was 
also eliminated. The absences just mentioned refer, of course, 
to those days on which the particular test under consideration 
was given. Any other absence standard might have been em- 
ployed. The more-than-two-days-absent standard seemed to be 
the one which would give the maximum accuracy of the scores 
with the maximum number of subjects. 

But the pairing in arrays was still more refined in the prac- 
tice tests. We may take Addition as an example of all of these. 
Suppose an individual were absent two days out of the ten odd 
days while he was present the ten even days. An average from 
the remaining eight odd days would be unduly decreased or in- 
creased as compared to the corresponding average from the ten 
days, according to whether the two absences were near the be- 
ginning or near the end of the practice. In order to overcome 
this difficulty, at least in part, the two scores which that individual 
would probably have made were padded in. Table A offers a 
. means for determining this probability for any day in the prac- 
tice. From Table A was calculated the average per cent of 
each day's increase or decrease with respect to the preceding day. 
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Using this per cent, the score which would probably have been 
made on the day when the individual was absent, was calculated 
from the last score made before or the first score made after the 
absence. Table B gives the raw arrays for all the tests used for 
the entire eighty-eight subjects. By eliminating the individuals 
who were absent on single-test days and also those who were 
absent more than two odd or two even days for any one practice 
test, the eighty-eight subjects were reduced to sixty-three sub- 
jects. When the two or less absent days were filled in with the 
probable scores, Table C resulted. Let us call Table C the 
'corrected arrays.' 

In closing this discussion one further remark is necessary. The 
original intention was to use more special tests than are shown 
in Table B. While these were dropped later, they figured in 
the elimination of pupils. Still another fact must be noticed. The 
teachers, who gave their opinion of the children's mental ability, 
ranked them in order from one to eighty-eight. When many 
individuals were eliminated gaps occurred in their ranks. It was 
decided to close up these gaps and make the range from one to 
sixty-three. 
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22.017] 



177. 0[8] 283. 018] 588.018] 549.0 [6] 

39.7 40.8 135.0 141.0 

151.0(5] 204.0C6] 272.014] B51.015] 



3SS.0[S] 

4ie.o[»] 

439.0 [9] 



Kim 

42.0[7j 



78. (I [0] 

79.0m 



784. 019] 
326.018] 


233.019] 
JM2.M8] 


89 8 


M0.0I7] 


104.9 


740.017] 
660.015] 


B62.0[7J 


594. me] 


mom 


57.2 


69.3 


s;.o 


SS.3 




10s. 5 


48.0 


51.9 


UT.8 


133.S 


145. 3 


149.3 


5M.019] 


44S.0I8] 


470.010] 


494.018] 


S64.0IB] 


602.019] 


ee.o[2] 


[0] 


170.012] 


170.012] 


275.013] 
692.017] 


204.012) 


29.9 


30.9 


rsj 


86.3 


108.5 


405.01.9] 


4S.3 


720.0J7] 


IS' 2 


S22.0LV] 


129.3 
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TABLE B (continued) 

Cancelling A Cancelling S 



Copying Addresses 



"1J! 



117.8*] 



sos.o[i] au.W5] 



HfcMn 

8B.0L6] 



I32.0[3] 

14S.OL2] 



17S.013J S0O.OC31 



182.0 [9] 
MB.0[« 
13.1 

1«.U[9] 

siii.o[9j 
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278.0 [6 J 184.0 [3] 



M.0[B] 

y/.oim 

11.0W] 
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TABLE I 
Handwriting 



{continued) 
Vim*] 

VocabuUrj 
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Teacher Hank Age io month* 
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TABLE B (.continued) 



TeicbetKtnk Age in month* 
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TABLE C 

Corrected Arrays : Scores or average scores made by 63 children in 

the tests shown at the top of the column. Under the practice tests : 

Column 1 — average score from odd days ; column 2 = average score 

from even days. The r— ' r J — : - -*- -' - L - — " — ■ 

G = girl. 



; number of days is shown at the top. 



Addition Cancelling;! Canceling! 



test 1 testa 



teat* test* 



71.9 96.8 105.5 



132.0 136.5 133.3 135.8 



37.3 31.8 105.3 
41.it 49.8 10O 



1D2.9 133.0 121.0 



103.8 117.fi 124.8 



M.3 113.3 113.0 



89.1 H.8 115.0 



13.0 11.9 127.3 



1M.0 150.8 



148 3 110.9 



90.8 103.3 lOS.g 115.0 



128.3 121.3 13B.I 



113.7 10S.9 



110.8 130.5 138.8 
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TABLE C (continued) 

Copyiug 
Cancelling A Cancelling S Addresses 



19.8 18.3 6.4 6.4 



as 


47,8 


53.0 


CO.O 


33.5 


24.9 


7.3 7.. 


47.! 


30.3 


43.3 


46.0 


16.8 


18.2 


7.6 7. 


30.9 


34.1 


33 


65.5 


14.7 


18.6 


3.0 6. 


»T 


58.0 


40.0 


36.0 


13.5 


15.3 


3.7 1.8 




77.3 


78.0 


77.0 


31.0 


33.0 


7.6 7. 


39.4 


48.0 


51.0 


55.3 




185 


6.3 5. 


51.9 


50.3 


40.5 


50.1 


2M 


38.2 


5.0 6.6 


ro.i 


70.0 


81.0 




35.8 


38.4 


9.4 9.3 


58.0 


88.) 


75. S 


so! s 






6.3 3. 


88.1 


87.7 


86.3 


83.0 


174 


lfcl 


4.7 4. 


S7.I 


58.1 


85.3 


70.8 


19.0 


20.1 


7.1 68 


13.7 


88.8 


59.0 


02.5 


333 


2,5.2 


0.3 3.3 


49.4 


51.0 


00. 


5!. 5 


17.4 


10.4 


6.5 3.4 


46.6 


46.6 




77.5 


sa.i 


22.8 


6.8 3.4 


u.o 


58.0 




77.5 


13.S 




8,0 0,8 




SI. 4 


53.0 


w 


17.4 


\SJ 


6.4 6.0 


70.6 


74.7 


03.5 




38.3 


30.8 


8.3 0.0 


59.3 


38.1 


75.5 


75.5 


20.5 


22.4 


5.9 8.0 


60.8 


38.3 




«4 •■ 


38.9 


39.3 


7.9 8.0 


43.5 


44 8 




sa.o 


16.4 


185 






88.7 


73.3 


74 5 


28.6 




7.1 7.: 


397 


37.7 


88.3 


84.0 


25.6 


25.7 


4.9 6.4 


3811 


tO. 2 


80 5 


82.5 


17.2 




5,0 5.9 


49.4 


52.4 


71.5 


686 


23.1 


23.8 


8.4 8.7 




• i • 


.15.0 


59 5 


20.8 


33.1 


7.0 7.4 


80.3 


90.9 


32.0 


87.0 


31.1 


21.7 


7.0 fl.8 


41.7 


38.8 


52.0 


•r : 


IS. 8 


10. 


4.9 4.0 


417 


49.0 


77.5 




18.5 




6.4 3.1 


S0.3 


80.6 


43.3 


51.6 


iso 


18.8 


7.5 7. 


U.O 


80.0 


Si. 3 


61.5 


29.7 


28.1 


7.9 6.3 




8S.4 


70.8 


86.0 


58.3 


37.3 


8.6 8.1 


48.4 


55,1 


39.3 


62.9 


21.3 


23.3 


6.7 6.7 




33.1 


43.0 


44.0 


14.7 


14.4 


6.4 5.5 


sis 


73.0 




55.5 




23.1 


7.3 7.6 


34.9 


44.9 




19.5 


16.2 




6.3 6.4 






























39.7 


44.9 


72.0 


76.0 


181 


IS. 8 


7.1 0.7 


43.3 


43.7 


48.6 


83.0 


10.1 


19.1 


8.8 6.7 


489 


85.4 


86.0 


69.5 


14.4 


15.7 


5.3 5.1 


ES.O 


80.3 


89.8 


76.0 


23.5 


24.4 


7.3 7.8 


44.(1 


aa.o 


88.8 


83.5 


17.3 


10.2 


6.4 8.8 


38.1 




88.0 




150 




7,1 7.3 


43.7 


48.0 


80S 


6LS 


18.5 


19.9 


7.9 8.3 


















58.7 


68.0 


88.5 


21.3 






81.7 


53.7 


53.D 


600 


20.5 


13.8 


7.1 0.9 


637 


56.3 


73.11 


73.7 




15 9 


7.1 7. 


43.1 


48.9 








18.8 


0.1 8.9 


73.3 


87.3 


8fc3 




18^ 


20.5 


8.1 8. 


80.8 


55 7 


84. S 


70.0 


39.8 


20.9 


9 4 8.9 


















54.4 


62.5 


81.0 


21.0 


23.0 


8.1 7.9 


88.9 


73.3 


44.5 


86.1 


n.o 


34.7 


7.4 7.5 
















83.8 


86.0 


62.5 


7S.5 


10.9 


22.8 


8.3 6.5 
















48.1 


81.9 






17.5 




6.2 6.5 










13.6 


15^3 


S.S 8.9 
















43.1 




88.9 


81.7 


22.2 


23.0 


8.4 8.4 
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Av. =10.1 21. • 80. 
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TABLE C (continued) 

■ A»in 

• M'othi Tcb'r Rink School Mark 



180.0 108.1 



138.5 108. 1 



70.0 103.0 



132.0 110.5 



108.5 142.5 
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2. Deviations and Their Combination 

The next step in calculating the coefficients of correlation was 
to turn all the scores in any one column of Table C into plus 
and minus deviations from the average shown at the foot of that 
column. These deviations are given in Table D. At the foot 
of each column is the square root of the sum of the deviations 
squared, which we shall find to be useful later. Further it will 
be remembered that Visual Vocabulary and the Omnibus tests 
were scored in terms of penalties, and what amounts to the same 
thing, a small measure by Teacher Rank means large excellence. 
To make these tests comparable to the others all their plus de- 
viations were changed to minus and all their minus deviations to 
plus. 

The reader will notice that two new tests appear in this devia- 
tion table. For reasons to be considered later it was found 
desirable to combine Visual Vocabulary with Completion. Col- 
umn 1 of this new measure is the algebraic sum by individuals of 
the deviations of Visual Vocabulary (1) and Completion (2); 
Column 2 is the sum of Visual Vocabulary (2) and Completion 
(1). The second of these tests or measures is a Composite. 
Column I of this Composite is an algebraic total by individuals 
of all the column i's of all the tests shown in Table E. Column 2 
of the Composite is the same thing for all the column 2's. But 
contrary to the Visual Vocabulary and Completion combination, 
not all the tests in Table E received equal weight. The weight 
actually given to each half of each test is shown under "Weight 
given," * in Table E. These weights were guesses, guided by what 
experimental evidence was then available, as to the relative value 
of each test as a measure of mental ability. Now the desired 
weighting was obtained by multiplying or dividing the deviations 
in any one column by the figure under "Multiple" in Table E. 
These figures were those which, when divided or multiplied into 
the square root of the sum of the deviations squared divided by 
ten, changed these square roots to the relative sizes shown under 
"Weight given" in Table E. In psychological literature such a 
Composite is usually taken as a measure of general mental ability. 
1 This weight was given before our own coefficients were calculated. 
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Deviations from thf. Average op Each Tbst 



92 — 1.7 



"V/SDcv.. 



— 18.0 — 8.0 



- 7.8 -10.7 -10. 



— <0.0 — 7.0 



- 7.J - 1.8 



- 8.0 - 9.1 — 10.S 
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CancellinsS Copying Addresaci Handw! 



hj — to 



8.6 — 8.7 



- 0.7 — ft.* 



4.3 - 3.11 - 1. 



- 0.7 - 0.8 

- 5.4 — 4.1 — l.a 

3.7 4.1 0.0 

- 1.1 — 8.7 — 1.8 



- 0.9 - 0.4 

- 1.4 - 1.8 

- 0.2 — 0.S 
0.8 — 0.1 



S.7 — 0.4 
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TABLE D (continued) 

lad. Vocabulary Completion Arithmetic 

1 > 13 1 t 

J4.1 + L« M -114 - U 1.4 

a - 8.K -J- 1.1 -1 - 0.4 1.1 1.4 



"V S Der.. 



— 1.1 — 0. 



- 1.1 — 1.9 



— 14 — 1 
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— S.7 — 1*. 



*:; 



-o —*. 



+J3.B 

pi 

+ 2.8 

+ 1S.1 



M —0.1 



-29> 


-20.S 


+ 1J.3 


— e.fl 




+2S.1 


—28.2 


+ 0.8 


+ 5.3 


-N.!> 


— 4L2 


— 3.9 


+M.3 


+30.8 


— 33.7 


-12.4 


+11.S 




-13.7 


—11* 


-12.7 


+ 1.8 


— 8.2 


+ 1« 


+33.8 


+2SI 






— 7.2 


+ 11 


-a;. 3 


-10.S 




114 


+118 


— l.B 


+33. s 


+:o!s 


+ 54.8 






- ».* — 1.3 



« 



11 
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TABLE D {continued) 

ad. School Hark Compoiite Visual Vocabulary 

-^Completion 



—HI — MS 



- (.* -MJ 

M —71.1 

11" 64.8 

-•.4 -W.5 
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TABLE E 
Weights Given to Each Test in Evolving a Composite Measure 
Multiple = the number by which the deviations of the tests to the 
left were multiplied or divided to secure the desired weighting. 





Divided by 


ir ' 




Multiple 




Given 


Addition 


t 


1673 






4 


= 


+2 




2 


17-38 






4 




4-3 


Cancelling 2 




15-90 






5 




3-3 




2 


1570 






5 




3-1 


Cancelling 3 


1 


15.00 






5 


= 


3.0 




2 


1472 






S 




3 


Cancelling A 


1 


9-17 






5 


= 




2 


9-96 






5 




2.0 


Cancelling S 


1 


9-99 






5 


= 


2.0 






976 






5 




2.0 


Copying Addresses 


1 


3.10 


X 


1 


= 


3.1 




2 


3-39 


X 


1 




34 


Visual Vocabulary 




3-98 


X 






40 




2 


439 


X 


1 




4-4 


Completion 




4.29 


X 


3 




12.9 




a 


409 


X 


3 




12.3 


Arithmetic 


1 


113 


X 


8 




9-0 






0.86 


X 


8 




6.9 


Reading 




2.87 


X 


2 




57 




2 


4-95 


X 


2 




,1:2 


Omnibus 




18.00 


X 








2 


i6.45 


X 






■6.5 


Teacher Rank 




14.43 




2 




7^ 


School Mark 


2 


14-43 
8.72 


X 


2 


= 


11 




2 


8.03 


X 






&o 


* This figure has no spccia 


•ignjfi 


■nee. 










3. Calculation of 


Raw 


COEFFICIEN 


TS 


of Correlation 



A coefficient of correlation is a numerical statement of the 
proportionality between two series of measures. If the excel- 
lence of the scores made by a number of individuals in one test 
is exactly proportional to the excellence attained by the same in- 
dividuals in another test, the correlation is positive and perfect. 
Using r as an abbreviation for correlation: r = -J-i. If the pro- 
portionality is exactly inverse, r = — 1. If there is no tendency 
to proportionality at. all, r = o. If there is a tendency to pro- 
portionality r is cither a positive or negative decimal according 
to the direction of the tendency. 

The standard method * has been used in calculating all the co- 
efficients of correlation. This method is expressed by the Pearson 
formula : 

■The Bravais-Galton- Pearson method. 
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Referring to Table D the method of calculating the r for, say, 
Addition (i) and Cancelling 2 (1) was, vis.: The deviations in 
the Addition (1) column were considered x's while the devia- 
tions in the other column were y's. The numerator of the for- 
mula was obtained by getting an algebraic sum of the products of 
every x multiplied by its corresponding y. The figures at the foot 
of the two columns being correlated were the denominators of 
the formula. Given these, r was easily calculated. By employ- 
ing this method the first measure of every test was correlated with 
its second measure; some measure of every test was correlated 
with some measure of every other test ; in certain instances, every 
column of a few tests was correlated with every other column 
of certain other tests. These first coefficients are called raw 
coefficients. 

4. Calculation of Corrected Coefficients of Correlation 

Thanks to the excellent work of Spearman, we now know 
that these raw coefficients are not true representations of the pro- 
portionality between measures or functions. He discovered that 
chance inaccuracies in the original scores did not balance them- 
selves out but that they always tended to reduce the correlation 
toward zero. 1 The correlation was said to be "attenuated." The 
next step in this study was to correct the raw coefficients for at- 
tenuation. There was used for this purpose Spearman's for- 
mula: 



v(fpiqi) ( r PiQt) ( r PtBi) (ffafla) 

^(fpiPl) (fort,) 

where, if A and B are the facts to be related, p is a series of exact 
measures of A, q is a related series of exact measures of B. 
r pq is the coefficient of correlation of A and B, obtainable from 
the two series /• and q, thus being the true coefficient. p 1 and p 2 
are two independent series of measures of A. </, and q 2 are two 
independent series of measures of B. r P , gi is the correlation 
when the first measure of A and the first measure of B are 
used. r Pt q 2 is the correlation when the first measure of A and 
the second measure of B are used and so on for the remaining 
i assumption see Brown, The Essen- 
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symbols. It is now clear why two measures for each individual 
in every test were necessary. Without two measures the raw 
coefficient is the best measure obtainable. 

The raw intercorrelations among all the tests (except the prac- 
tice tests) for which there were double measures, were calculated 
for every column with every other column in that group. This 
group also included the Composite. These raw coefficients sup- 
plied all the necessary data for calculating the true coefficients 
from the Spearman formula. Now the practice tests gave much 
more reliable measures for each individual; hence, whenever a 
practice test was being correlated with any other test just enough 
coefficients were calculated to satisfy the shorter correction for- 
mula: 

_ jKfftft) ( r Wa) 

r M = 

V(r (lft ) (r flias ) 

By the use of either of these two formulas the corrected coefficient 
or the true correlation was found for every test or function 
which was measured twice. The Age of Reaching the Grade, 
while really one measure, was treated as though split exactly in 
two, r qi q z in the shorter formula thus being considered as -f- 1. 
This left only one test uncorrected. Table F gives the corrected 
coefficients or the true correlations between the tests and the 
functions which they measured. A gap in the table means that 
the true coefficient is substantially zero. The correction at that 
place was impossible either because one of the raw coefficients 
turned out zero or because one was a small positive and the other 
a small negative. In either of these cases the correction for- 
mula fails to work. 

The shorter correction formula above is the same as the 
longer formula except that two symbols have been omitted from 
the numerator. Theoretically, it would have been better to have 
retained the omitted and omitted the retained symbols, but, prac- 
tically, the difference in correction is insignificant. The longer 
formula is to be preferred but the time required often makes its 
use prohibitive. 
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5. Reliability Coefficients 

The significance of the corrected r's shown in Table F is de- 
pendent on their reliability. This reliability is in turn dependent 
on the number of subjects used and the amount of correction 
that has been applied. The "reliability coefficient" or the raw r 
for two separate measures of any one test indicates the amount 
of this correction. The corrected r for two tests whose reliability 
coefficients are exceedingly small is of doubtful value. Some 
of the factors * which make for high reliability coefficients are : 
that the function tested be narrow ; that the time spent in test- 
ing be long; that the test material and experimental technique 
for the two tests be identical ; and that there be no large variation 
in the condition of the subjects. The reliability coefficient for 
every test having a double measure is shown in the table of raw 
coefficients further on in this book, but for convenience they 
are summarized below. 



Reliability Coefficients, together with the Total Time Spent on the 
Test or Tests Composing either One or the Two Correlated 

Measures 



Addition, 100 minutes (10 tests) 

Cancelling 2, 8 minutes (8 tests) 

Cancelling 3, 8 minutes (8 tests) 

Cancelling A, 7 minutes (7 tests) 

Cancelling S, 4 minutes (4 tests) 

Copying Addresses, 100 minutes (10 tests) 

Handwriting, 40 minutes (10 tests) 

Visual Vocabulary, 30 or less minutes (1 test).. 

Completion, 30 or less minutes (1 test) 

Arithmetic, 30 or less minutes (1 test) 

Reading, 30 or less minutes (1 test) 

Omnibus, 60 or less minutes (2 tests) 

School Mark, 1 
Teacher Rank.. 
Composite 



The very, very high reliability of the tests from Addition 
through Handwriting is due chiefly to the narrowness of the 
functions tested, the similarity of the test material and also, in 
the case of Copying Addresses and Addition, to the relatively 
large amount of time spent on the tests. Intercorrelation among 
these tests scarcely needed correction. The reliability of Arith- 
1 These factors do not grow out of our data. 
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metic and Reading is unsatisfactory ; that of Visual Vocabulary 
and Completion leaves something to be desired ; all the rest are 
satisfactory. The coefficient for Teacher Bank is surprisingly 
large, due probably to the close cooperation of the two teachers 
in teaching the same children. So, with regard to reliability, the 
only corrected coefficients* which need to be closely scrutinized 
are those with Arithmetic and Reading. 

We have spoken of the reliability of the tests as dependent 
on the amount of the correction. It is important to know the 
reliability of any particular coefficient derived from these tests. 
This is dependent on the number of cases or the number of 
individuals. P. E. is the measure of this reliability according to 
the formula: 

p _ .6745 (■ = ^ 
\» 

where r = actual coefficient of correlation and 

n = number of cases included. If the number of cases were 
infinite the reliability would be absolute. We have always used 
sixty-three cases, hence 

.67 (i— r ! ) 
P. E. = , 

Using this formula we get : 

Probable Error of the Coefficients of Correlation 

r P. E. 

.1 .08 

a .03 

.3 .08 
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CONSIDERATION OF PROBLEMS AND COMPARISON 
OF RESULTS WITH THOSE OF OTHER EX- 
PERIMENTERS 

1. What Are the Intercorrelations among Some Recent 

Educational and Vocational Measurements and Cer- 
tain Traditional Tests? 

The first problem which this study set out to attack has now 
been solved. The corrected coefficients given in Table F are 
the answer. Since these correlations will be considered in con- 
nection with other problems, a detailed discussion at this place 
would be tedious. In interpreting the corrected r's the reader 
should remember one fact in addition to the cautions given 
in the preceding chapter. Handwriting was scored by amount 
copied and no attention was given to the quality of the penman- 
ship. A large score in this test might mean that the quality of 
the writing had been sacrificed. On the other hand, it might 
be contended, from a study of the penmanship of men of great 
ability, that increased speed and decreased quality both correlate 
very highly with mental power. With no evidence to offer, the 
author prefers to leave the matter to the opinion of the reader. 

2. What Is the Order of Each Test's Correlation with 

Mental Ability? 

Before this problem can be solved we must have some measure 
of mental ability. This study proposes three different standards 
by which to measure each test. 

The first standard includes all the available measures which 
are outside our psychological tests. The ideal standard would 
be one which properly weighted all the activities in the life of an 
individual. A complete standard would take into account not 
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only how well one does in a psychological test but also what kind 
of grade is made in school, what kind of opinion the teachers 
have, how well the games of ball are played, the papers sold, the 
errands run, etc. Of all these things there are, outside the 
psychological tests, just two measures available: Teacher Rank 
and School Mark. The value of these two measures as one of 
our standards consists in the fact that they represent an at- 
tempted weighting of numerous activities, and that they are 
measures free from any preconceived opinions of this study. The 
corrected r's in Table F for Teacher Rank and School Mark 
have been averaged for each test, and the positive size of this 
average has been taken as that test's correlation with mental 
ability. 

The second standard used is the correlation of each test with 
the Composite. The Composite combines the standard just de- 
scribed with the psychological tests. Possibly the Composite 
gives too much weight to the Cancellation tests but, in view of the 
later discussions of this book, it is perhaps wiser to err in this 
direction. All considered, the writer believes this to be the best 
measure of mental ability available for this study. 

The third standard by which to determine the value of a test 
as a measure of mental ability is the average of that test's corre- 
lations with all the other tests. But immediately we get into a 
difficulty, a difficulty which was minimized in connection with 
the use of the Composite as a standard. A glance at Table F 
will show that there are at least two distinct groups of tests which 
oppose each other: the Cancellation group and the group rep- 
resented by the Complex tests. In evolving the Composite meas- 
ure, this difficulty was surmounted by arbitrarily giving a rela- 
tively small weight to the Cancellation tests. But with the third 
standard where equal weight is given to each correlation the Can- 
cellation group will exert an important influence. Obviously, it 
would not be fair to give as much weight to five Cancellation 
tests as to five other separate tests, especially when the Cancella- 
tion group measures such a narrow function. If there were just 
one such test the matter would not be so serious. If the Can- 
cellation tests are good measures of mental ability then the 
Complex tests are not. In this dilemma our first standard proves 
its worth. Teacher Rank and School Mark, admitted by all 
experimenters to have considerable value as measures of men- 
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tal ability, vote against the Cancellation group. Further, com- 
mon sense shows that the other group measures a wider range 
of abilities. Moreover, any one test in the Complex group shows 
a wider range of positive correlation. Consequently, no test will 
be used for the third standard that does not show a distinct 
positive correlation with the first standard. This eliminates Age, 
Handwriting, and the Cancellation tests. 

Using these three standards the order of each test's correla- 
tion with mental ability is shown in Table H. 



Order of Correlation of Each Test with Mental Ability by Stand- 
ards 1, 2 and 3 and by an Average of thk Three. (Data from 

Table F) 



Omnibus -75 

Completion .73 

Teacher Rank 

School Mark 

Reading .68 

Arithmetic .62 

Visual Vocabulary .44 

Copying Addresses .34 

Addition .23 

Handwriting .02 

Cancelling A .00 

Cancelling S —.10 

Cancelling 3 —.28 

Cancelling 3 — J8 

Age —.50 —.26 —.19 

In studying Table H it is important that the reader remember 
that a coefficient of correlation from arrays of averages is not 
necessarily the same thing as an average of several coefficients 
of correlation. An example of the former are the coefficients 
in the column under Composite, while an example of the latter 
are the coefficients in the other three columns. But our problem 
is not now to discover the absolute coefficient of correlation be- 
tween any one test and mental ability ; it is to rank the tests 
relatively, i.e., which test correlates most closely, which second, 
which third, etc. Each of the three standards should give sub- 
stantially the same ranking to each test. In fact, the agreement 
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is remarkable. The average of the ranking by the three standards 
is practically the same as the ranking by any one of the standards. 
This average can be taken as the answer to our problem. 

3. How Close Is the Correlation of Each Test with 
Mental Ability? 

The answer to the above problem depends upon which standard 
is accepted as the best measure of mental ability. Omnibus cor- 
relates .75 with Standard 1, 1.00 with Standard 2, and .66 with 
Standard 3. Which is the truest coefficient? To trust to an 
average of the three, as was done in section 2, would merely 
serve to conceal glaring differences. The Composite is better 
than Standard 1 because it includes Standard 1 along with many 
other valuable measures. Standard 3 or the correlation of each 
test with all others gives an equal weight to all the measures 
composing it, but all three standards agree that all the tests do 
not equally measure mental ability. The Composite gives a 
weighting which is, at least, roughly correct. Strictly speaking, 
the correlation of a test with all other tests taken separately is 
a measure of a test's correlational spread rather than an absolute 
measure of its closeness of correlation with all these separate 
abilities considered together. So far as the question under con- 
sideration goes, Standard 3 assumes that, disregarding chance 
errors in measurement, any one test is as good a measure of 
mental ability as any other and that any one test is as good as 
all averaged together. The Composite, on the other hand, con- 
siders a sum of properly weighted abilities a better measure of 
mental ability than any one of them taken separately. For 
these reasons this study considers the Composite the best avail- 
able measure for determining the absolute correlation between 
any one test and mental ability. 

Since we are hopelessly immersed in theory, we may as well 
consider the most important objection likely to be offered to the 
Composite. It might be said that the Composite causes a test 
to show a spuriously high correlation with mental ability because 
it is composed of the tests which are to be correlated with it. On 
the contrary it might be argued that to eliminate Completion, 
say, from the Composite before cowelating it with the Composite 
would unfairly reduce the correlation, for mental ability means 
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the ability to do Completion as well as the ability to do the 
thousand and one other things which enter into complete living. 
To strike a true balance between these two contentions would be 
difficult 1 if not impossible, consequently the Composite has been 
retained in its original form. 

Using, then, the Composite as a standard, the closeness of the 
correlation of each test with mental ability is shown in column 
2 of Table H. This column reveals five interesting facts : 

a. Omnibus and Completion correlate perfectly with mental 
ability. To be exact, Completion correlates +.96. 

b. Seven of the tests correlate closely with mental ability. 

c. The Cancellation tests give a negative correlation with men- 
tal ability. 

d. The Age of Reaching the Grade also correlates negatively 
with mental ability. 

e. The coefficients for the tests which measure power are in 
every case larger than the coefficients for the tests which measure 
speed. 

4. What Is the Practical Significance of These Facts 

for Educational and Vocational Diagnosis and 

Guidance? 

Before considering each of the above facts in the light of the 
problem just stated it is interesting to consider another question: 
just what is the need for measuring mental ability ? The pseudo- 
philosopher derives his greatest pleasure from discoursing upon 
the negative correlation which exists between the academic and 
the real world. In one respect at least this antagonism no longer 
exists. The most persistent demand that has come to the psychol- 
ogist in the last few years has been, that he develop a means for 
measuring that most elusive yet pre-eminently valuable thing 
which we call mental ability. And this call comes from school 
and factory alike. 

The school wants to adjust its training to the individual differ- 
ences of the pupils. How can it measure these differences, is the 
question asked of the psychologist. The principal wishes to class- 
ify a group of children by ability. How measure the ability? 
The junior high school wishes to put in one group the supernormal 
1 There is a statistical method by which the amount of spurious cor- 
relation can be determined. 
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pupils, iit another group the normal, and in another the subnor- 
mal. How be certain the pupil is not wrongly placed? Educators 
realize that some pupils simply haven't the ability to deal with 
mental elements, abstract symbols and the like. Which pupils? 
A class for mentally defective children is being formed. Who 
should be in the class? A college in the West is planning to select 
its Freshman class on the basis of mental tests. Are the tests 
valid measures of mental ability? Experimenters everywhere 
wish to form groups of equal ability. By what standard shall 
they be called equal? Sociologists wish to discover if unemploy* 
ment is the result of mental defectiveness. How gauge the men- 
tality? Makers of mental tests desire a standard by which to 
measure their .own product. What standard is reliable? The 
youthful yet virile science of vocational guidance wants to pre- 
vent or diminish the present fearful misdirection of energy. 
Business is little less clamorous, but no more need be said to 
show the very great importance of discovering excellent measures 
of general ability as well as tests for special powers. Now let 
us return to the significance of the facts reported in the last 
section. The first of these was : 

(a) The Omnibus and Completion Tests Correlate Perfectly 
with Mental Ability 

The problem of measuring every single activity of an individual 
in order to determine his general mental ability, is, of course, 
impossible of solution. So psychology has been trying to find 
a few measures which epitomize all possible measures. So far 
as the writer is informed, the test which has received the most 
favorable mention in this connection has been the Ebbinghaus 
Mutilated Text. The Completion Test, mentioned above, is a 
development by Dr. Trabue of Ebbinghaus' idea. This study 
finds ample justification for the high favor accredited the Eb- 
binghaus Test and it congratulates Dr. Trabue upon a modification 
of it which is likely to prove still more valuable. If we remem- 
ber that mental ability means mental ability as measured by our 
Composite, the Completion Test correlates with it -|- .96. The 
correlation is not exactly perfect but it is very nearly so. 

This study is equally pleased to congratulate Dr. Thorndike 
upon having compiled and in part devised the Omnibus Test 
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which correlates + 1.00 with our Composite. The Completion 
Test was given for thirty minutes, the Omnibus for sixty minutes. 
Does this correlation of -+- 1.00 mean that a test has at last been 
devised which gives a perfect measure of an intellect by one hour 
of testing? It must not be forgotten that the -(- 1.00 is a cor- 
rected coefficient. Were the 1.00 a raw coefficient and were the 
Composite adequate the above question could be given an affirma- 
tive answer. The corrected coefficient -|- 1 .00 means that were 
an individual measured enough times with the Omnibus Test to 
he certain of an accurate score, then that individual would have as 
perfect a measure as if he had been given all the tests compos- 
ing the Composite. How many times and how long each time a 
person would have to be tested in order to give a perfect * meas- 
ure of him in any one function is for a future research to deter- 
mine. But granting the Composite is not an adequate measure 
of mental ability and granting the correction is a little too large, 
the fact remains that the Completion Test and Omnibus Test are 
very excellent ones. But because of the multiplicity of mental 
functions and the variability of their, performances it is wise 
to give several types of tests and possibly to secure several 
measures for each type. This brings us to the second significant 
fact mentioned a few pages back : 

(b) Seven of the Tests Correlate Closely with Mental Ability 

Since it is wiser to trust to several tests than to one or two, 
those interested in educational and vocational diagnosis, guid- 
ance, and classification as well as vocational selection will want 
advice as to what tests this study would recommend. Of the 
fourteen measures used, we consider the following to be the 
best and most reliable indices of intellect : Omnibus, Completion, 
Visual Vocabulary, Teacher Rank, School Mark, Reading and 
Arithmetic. The first five tests are the best. An average from 
them will give a good measure of an individual's ability, and that 
with the expenditure of just two hours in actual testing. The 
difficulty of the purely psychological tests could be varied to 
suit the ability of the group being tested. It ought not be long ' 
until other tests are devised which can be added to this small 
group. It is not too much to hope that the near future will 
1 This term is used loosely, for psychology is far from agreement as 
to what constitutes a perfect measure. 
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find psychologists able to measure general mental ability very 
accurately for a group of any si2e after one day of testing. 
Until that time comes we now have tests which will measure in- 
tellect roughly at least. And for many purposes such a rough 
measure will suffice. 

To the five measures recommended in the preceding paragraph 
three criticisms suggest themselves. In the first place, Teacher 
Rank and School Mark are not always available. Or in cases 
where they are available, it is often impossible to use them because 
Teacher Rank is not an absolute measurement and because School 
Mark varies in meaning even within one school. In the second 
place, the psychological tests recommended, measure, primarily, 
abstract ability — the ability to handle ideas and symbols rather 
than to deal with "things and their mechanisms." All that we 
know * about the relation between Idea Thinkers and Thing 
Thinkers indicates that the man who is good at manipulating 
ideas is potentially good in manipulating things. If the mechani- 
cal skill desired requires special training this criticism is more 
serious. The third criticism is that such tests as these can only 
be given to literate people. This is true but it is a fault which 
our schools are repairing every day. These three criticisms 
merely limit the usefulness of these measures and they emphasize 
the fact that even psychological testing requires the exercise of 



Another result of this study which may prove of practical 
value is: 

(c) The Age of Reaching the Grade Correlates Negatively with 
Mental Ability 

Probably every text-book on the psychology of individual dif- 
ferences mentions maturity as an important factor in producing 
differences in mental ability. But no educational administrator 
now believes that mental age always coincides with chronological 
age. If he does so believe, he does not dare use it as the sole 
basis for the classification of the school children. A very com- 
mon complaint among young teachers is that their chronological 
age weighs heavier than their mental age with school superin- 
tendents. Besides these immediately practical significances, the 
1 We greatly need tests of mechanical ability to experimentally test 
this statement 
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influence of age is of keen concern to almost everyone who is 
engaged in educational or psychological research. Correlational 
psychology, for example, is in constant fear lest its insidious 
influence operate to produce spurious correlation. To be brief, 
no one would object to this statement : below the age where 
senility begins, the tendency is for the older individuals to be 
the more able. In so far as the two sixth grades studied here 
are typical of all grades, we find an exactly opposite tendency, 
which may be summarized, vis.: in any one class the tendency is 
for the more mature to be the less able. This is no rank heresy 
nor is it an unpredictable mystery. If a pupil is overage for his 
group it probably means that he has been retarded, and this in 
turn probably means that he started life with an intellectual 
capacity which could be expressed as a minus deviation from the 
average. So the influence of maturity is not a simple one, or to 
speak more exactly, age is no sure criterion of mental ability. 
The meaning of age is dependent upon the group in question. 
The scope of the negative correlation found in this study needs 
to be tested by experiments upon other grades and other groups. 
Even more important is the next fact growing out of this re- 
search: 

(d) The Cancellation Tests Show a Negative Correlation with 
Mental Ability 

We say above that Cancellation correlates negatively with the 
Composite. The 2eros after Cancelling A and Cancelling S 
(Table H) mean that in those two cases the correction formulas 
could not be applied. In addition to the evidence of Table H 
the trustworthiness of the negative correlation is further certified 
to by the fact that the Cancellation tests correlated negatively with 
each of the seven tests which have shown themselves to be good 
measures of mental ability. The coefficients are small but dis- 
tinct. 

It is beyond the scope and data of this research to consider 
why, so far as psychology is concerned, there has been such 
a chasm between laboratory and life. We suggest that possibly 
we have here, in the negative correlation of Cancellation with the 
Composite, one element of a complete explanation. The Can- 
cellation Test is a not unfair sample of what traditional psychology 
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has been employing in its laboratories. In order that positively 
interpreted results from such a psychological test correspond 
to results from practical experience, what is would have to coin- 
cide with what tends not to be. But a problem of such magnitude 
cannot be settled by the relatively meagre data of this study. 

The point of main interest for us is that the Cancellation tests 
are now in very common use. A Cancellation sheet ts about the 
first one that enters a newly established laboratory. One college 
is trying them out, along with others, as a partial entrance test. 
If other researches substantiate this one and experimenters con- 
tinue to use it, the test must be interpreted negatively. But even 
here the correlation is so low the test is just about valueless for 
any positive purposes. 

(e) The Correlations with Mental Ability of the Tests which 
Measure Accuracy and Speed Are Smaller than the Similar 
Correlations of the Tests which Measure Accuracy, Speed, 
and Power 

Psychological and educational tests are readily divisible into 
two main groups : tests which measure accuracy and speed and 
those which measure accuracy, speed, and power. The factors, 
accuracy, speed, power, are really elements of every psychological 
test, hence our division may seem to the reader somewhat arbi- 
trary. The division into two groups is due not so much to differ- 
ences of elements as to differences of emphasis. The emphasis 
in the first group is upon accuracy and speed so let us call the 
tests classified there, 'speed tests.' In the second group the em- 
phasis is upon accuracy and power, so let us call these tests, 
'power tests.' 

As stated before, speed tests measure accuracy and speed 
primarily. They are usually simple in form and easily within 
the ability of the group being tested. Further, all parts of the 
test are about equally difficult. The chief characteristic of this 
type of test is that its units seldom approach in difficulty to the 
maximal ability of the group being tested. The instructions 
accompanying these tests, are to work as rapidly as possible with- 
out making errors. Our own Addition is an excellent example 
of a speed test. Courtis's Arithmetic as usually given is another 
example, though with sufficient time his tests could be used in 
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such a way as to make them power tests. Practically all the tests 
employed by the older, traditional psychology, such tests for 
example as 'Reaction Time,' 'Cancellation,' etc., belong in this 
group. 

The power tests involve speed, to be sure, but the chief factors 
are accuracy and power. By 'power test' we mean one that 
contains units sufficiently difficult to discover the maximal ability 
of the person or persons being measured. A power test is usually 
of a more complex nature than a speed test. The first part is so 
easy as to be within the ability of the stupidest member of the 
group being measured, while the remaining parts of the test 
grow progressively more difficult until the maximal ability of- 
the brightest individual is measured. Our Trabue Completion 
is an excellent example of this type. The Binet Test belongs 
in this group also. Mr. Clifford Woody is engaged in making 
arithmetic tests * of the same nature. In fact most of the recent 
educational and psychological tests could be classified here. 

Of the tests used in this study, Cancellation, Handwriting, Ad- 
dition, and Copying Addresses are speed tests, while Visual 
Vocabulary, Completion, Reading, Arithmetic, and Omnibus are 
power tests. We have called the Omnibus a power test not 
because it is of the same nature as Completion but because it is 
complex, because some of its units grow progressively more 
difficult, and especially because all the units of the test hover 
close to the maximal ability of the group tested. 

For the practical purpose of measuring mental ability which 
tests offer more promise, those of the speed type or the power 
type? The first evidence we have to offer is shown in column 2 
of Table I. The coefficients in that column do not recommend 
the speed tests. Of the five different kinds of tests used, Copying 
Addresses proves itself the best as a measure of mental ability. 
But even it is always surpassed in correlation by what we have 
termed the 'power tests.' Of course, this comparison, which 
has resulted unfavorably for the speed tests, refers only to the 
tests used in this research. Copying Addresses, however, prob- 
ably ranks considerably above the average speed test in its corre- 
lation with mental ability. At least it probably occupies as 

1 "Measurements of Some Achievements in Arithmetic," Clifford 
Woody, Teachers College, Columbia University, Contributions to Educa- 
tion, No. 8o. 
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favorable a position with respect to the speed tests as does, say, 
Visual Vocabulary with respect to the power tests. In so far as 
this is the case, the scope of our comparison extends to tests not 
employed in this study. 

It is interesting to enquire into the causes for this difference 
in correlation between the speed and power tests. We believe 
that the emphasis upon power, not as opposed to but as superior 
to speed, is one significant element. Much more experimentation 
would be required to establish this view, but so far as they go 
our results harmonize with such an assumption. Another sig- 
nificant element seems to be the complexity of the function tested. 
On the whole the power tests do measure more complex functions. 
The Omnibus is preeminent in complexity and in correlation with 
mental ability. The Cancellation tests are preeminent as to the 
narrowness of function they measure and they are last in their 
correlation with mental ability. The tests in Table I are ar- 
ranged in the order of their correlation with mental ability. An 
order for complexity, so far as we can judge complexity by 
external appearance, would seem to correspond very closely to 
this arrangement by correlation. It is a matter for congratulation 
that the more recent mental and educational tests are embodying 
these elements of complexity and power. It is a pity the simple 
speed tests are not as valuable as the complex power tests, for 
they are easier to score. Furthermore, the complex power tests 
are not readily usable in long time practice experiments. By 
increasing the complexity of the speed tests we may yet make 
them valuable measures of mental ability. 

In our comparison thus far we have considered only cor- 
rected coefficients. The practical measurer of mental ability 
must. base his conclusions upon raw scores and not upon scares 
derived from many more measurements. Hence a practical com- 
parison of speed and power tests must be made with raw as well 
as corrected coefficients. Table I gives the raw coefficients not 
only of each test with every other test, but, what concerns us most, 
the raw coefficients of each test with the Composite. Since each 
test has two or more coefficients with every other test, Table I 
is rather confusing, so for convenience, the reader is referred to 
Table J which is an average of the coefficients of each test with 
every other. 
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Table J permits a comparison of the closeness of raw correla- 
tion between each power test and the Composite with that between 
each speed test and the Composite. Consulting this table we dis- 
cover that Copying Addresses, which is the best of the speed 
tests, shows a correlation of -|- 49 with the Composite, while 
Omnibus shows a correlation of -|- .80. In every instance, except 
in the case of Arithmetic, Copying Addresses gives a lower corre- 
lation with mental ability than do the power tests. So the raw 
coefficients say as emphatically as the corrected coefficients that 
a better idea of mental ability can be gotten by measuring with 
Omnibus, Completion, Visual Vocabulary and the like than could 
be gotten by running a practice experiment with Copying Ad- 
dresses, Handwriting, Addition, or Cancellation. 

The comparison of the speed and power tests is not yet com- 
plete. The speed tests as used in this study make available two 
important measures: an average of all the daily scores and the 
amount of improvement shown by subtracting the first measure 
of a test from the last measure. In general, a power test pro- 
vides just one measure or else so few measures that improvability 
is too small to be of much use. Hence the power test has but 
one measure to balance the two obtainable from a practice test. 
It is conceivable that improvability with a speed test is a better 
intellectual index than a score from a power test. To discover 
if this be the case, the improvements made in the practice tests 
were correlated with the Composite. The improvement arrays 
were calculated in the following manner: the scores made on 
the first day by any one individual in Cancelling 2 and Cancelling 
3 were combined and subtracted from the sum of the scores made 
on next to the last day. In order to get a reliability measure and 
to correct for attenuation, a second measure was calculated for 
each individual by subtracting the combined scores made on the 
second day from the combined scores of the last day. By a similar 
procedure a double measure was calculated for Cancelling A, for 
Addition, and for Copying Addresses. The absence of any in- 
dividual on any one of the four critical days was corrected for 
as in Chapter III, Sec. 1. The improvement thus calculated was 
correlated with the Composite by the method described in the 
early part of this book, the only difference being that in correct- 
ing for attenuation the other half of Spearman's formula was 
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used. The raw and corrected Pearson coefficients are given i 
Table K. 



f Improvement with Mental Abtuty (Composite) 
Raw Coefficients 



Cancellation 2 + 3 <i) with (2) (Reliability) .83 

Cancellation 2 + 3 (1) with Composite (2) 26 

Cancellation 2 + 3 (2) with Composite ( I ) 13 

Cancellation A (1) with (2) (Reliability) .41 

Cancellation A (1) with Composite (2) —.09 

Cancellation A (2) with Composite (1) .07 

Addition (1) with (2) (Reliability) .80 

Addition (1) with Composite (2) 

Addition (2) with Composite (1) 

Copying Addresses (1) with (2) (Reliability) 

Copying Addresses (1) with Composite (2) 

Copying Addresses (2) with Composite (1) 

Average Row Coefficients 

Cancellation 2 + 3 with Composite .20 

Cancellation A with Composite — .01 

Addition with Composite ,26 

Copying Addresses with Composite .05 

Corrected Coefficients 

Cancellation 2 + 3 with Composite zi 

Cancellation A with Composite 

Addition with Composite .26" 

Copying Addresses with Composite 

If we compare the average raw coefficients of correlation in 
Table K with the column under Composite in Table J we see 
that improvement in the practice tests was, if anything, an even 
poorer measure of mental ability than was an average of all 
the scores. By the use of averages Copying Addresses did show 
a substantial correlation with the Composite, whereas by the use 
of an improvement measure, its correlation dropped almost to 
zero. 

In considering the practical value of tests, other factors than 
those discussed should receive at least a passing mention. These 
are ease of administration and scoring and the amount of time 
required. Further it is just as important to ask what is the dis- 
tribution of the time given to the test as it is to ask how much 
time is actually spent in testing. Thirty minutes of testing con- 
centrated in one period, for example, is usually more convenient 
than fifteen minutes distributed over three days. 
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Of all psychological tests the Binet is the best known and the 
most perfectly standardized ; yet for general use it will probably 
be supplanted by tests which require less skill and less time to 
apply. The problem of extending the sphere of psychological 
and educational measurement is very largely that of substituting 
group for individual testing. The speed tests and power tests 
used in this study are all well adapted for group measurement. 
They do not materially differ in ease of administration, nor is 
there a very great difference in ease of scoring. There is a differ- 
ence, however, and this difference favors the speed tests. The 
speed and power tests can be compared for time and convenience 
by consulting Table G. This table considered in conjunction 
with Table J shows that one hundred minutes of Copying Ad- 
dresses when distributed over ten days gives a correlation of 
-{- .49 with the Composite. Omnibus with only sixty minutes of 
continuous testing gives a correlation of -|- .80 with the Com- 
posite. In every instance the time spent upon the power tests 
was considerably less than that spent upon Copying Addresses. 
To sum up the entire discussion, the power tests give a much 
higher correlation with mental ability than do the speed tests ; 
and this is true whether average score or improvement is used 
as the measure of the speed tests. Further, the power tests equal 
the speed tests in ease of administration, and they surpass 
them in time convenience. Ease of scoring, only, favors the speed 
tests, but this superiority is so slight as to be of small conse- 
quence. 

The issue thus far has been drawn, on the one hand, between 
those of our tests which are simple in nature, which measure a 
relatively narrow function, which are considerably below the 
upper limits of ability, which have units roughly equal and which 
were designed and are adapted to measure speed and accuracy ; 
and, on the other hand, those tests which are relatively complex, 
which measure a wider range of functions, which hover close 
to the upper limits of ability or else begin easy and grow pro- 
gressively more difficult Thus far we have considered the com- 
parative excellence of these two main groups of tests as measures 
of mental ability. We can further draw the issue not between 
the two types of tests but between the two methods of adminis- 
tering any of them. It has been claimed that the amount of 
improvement shown by a practice test is a better intellectual index 
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than are "snap-shots" with those tests. The snap-shot test meas- 
ures improvement from birth or conception, not to go back fur- 
ther, to the time in the life of the individual when the test is 
given. The practice test, on the other hand, measures, improve- 
ment from the first to the last trial at that particular test. This 
issue could be settled fairly only by comparing the coefficients 
gotten by correlating the score from the first trial with mental 
ability and by correlating improvement, found by practice at 
that same test, with mental ability. But here our troubles begin. 
Those complex, snap-shot tests which show a high correlation 
with mental ability cannot conveniently be used in a practice ex- 
periment. And since only those which we have called the speed 
tests can be readily used for practice purposes the issue is really 
the same as that between the speed tests and the power tests, the 
speed tests representing the improvement measure and the power 
tests representing the snap-shot score. The decision reached in 
the preceding discussion favored the power tests. 

It is possible, however, to view the speed tests, such as Addi- 
tion, Copying Addresses, etc., as snap-shot as well as practice tests, 
and thus secure a comparison of the two methods. The first 
trial of these tests has not been correlated with mental ability 
but improvement has, and the results are shown in Table K. 
If the average from all the trials may be considered as at least 
a partial representative of the first trial then the coefficients for 
the speed tests in Table J under the Composite reveal some inter- 
esting inconsistencies. Measured by an average, Copying Ad- 
dresses shows the closest correlation with mental ability of all the 
practice tests ; measured by improvement it shows about the 
least correlation. The average correlates a little closer than the 
improvement in the cases of Addition and Cancellation of A's, 
while improvement has a slight advantage in the case of Can- 
cellation of 2 and 3. However we may explain these apparent 
inconsistencies by differences of physiological limit, the fact re- 
mains that improvement in these tests is a very poor measure 
of mental ability, even poorer than an average, and probably no 
better than a first trial. In no case does it even approach a snap- 
shot score for a power test. 
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5. What Are Some Theoretical Considerations Growing 
Out of This Study? 

(a) Is there such a thing as a negative correlation between 
desirable functions ? Is the law of human nature correlation or 
compensation ? 

Rightly or wrongly Emerson is usually held responsible for a 
philosophic statement of the law of compensation. The law 
has been given a more scientific terminology by certain German 
psychologists, especially in connection with their attempt to clas- 
sify individuals into types. Stated in whatever form, the impli- 
cation is that there exists a negative correlation between desirable 
traits. From such a doctrine springs the idea that the higher 
the ability in dealing with abstract things, the lower it is in dealing 
with concrete things; that slow learners are long rememberers; 
that the person endowed with beauty is by the justice of Nature 
left devoid of brains; in short that Nature always balances a 
superiority with an inferiority. In the third volume of his "Edu- 
cational Psychology," Professor Thorndike vigorously assails 
this doctrine. "It should also be noted that in original nature 
the rule .is correlation, not compensation." Or again, "It is very, 
very hard to find any case of a negative correlation between de- 
sirable mental functions. Divergencies toward what we vaguely 
call better adaptation to the world in any respect seems to be posi- 
tively related to better adaptation in all or nearly all respects. 
And this seems especially true of the relations between original 
capacities." In the stand taken by Dr. Thorndike, the author 
heartily concurs. Hence it is with no small surprise that he finds 
himself compelled to appear as a defender of inverse correlation 
between desirable mental functions. The only way to avoid 
the necessity of advocating a theory so unpopular with recent 
psychology is to call the ability to cancel the figures 2 and 3 
or the letters A and S, an undesirable mental trait. The ability to 
perceive a thing, pick it out from other things, and do something 
with it seems so fundamental to all our mental life that we are 
scarcely justified in calling such an ability undesirable. Nor can 
we, without outraging the best of our common sense, call unde- 
sirable the abilities to do the Visual Vocabulary, Completion, 
Reading, Arithmetic, and Omnibus tests, or to make good marks 
in school and secure the teachers' esteem. And yet between the 
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Cancellation tests and this more complex group we find a negative 
correlation. 

If the reader will turn back to Table I and count the number 
of coefficients of correlation which have been calculated between 
the Cancellation group and the complex tests mentioned above, 
he will discover that there are 56 such coefficients. Of these 53 
are negative and only 3 are positive. Further, of these 3 not one 
coefficient is as large a positive as +.10 while there are negative 
coefficients of — .35, — .36, — .37 and — .39. The average of the 
3 positive r's is -f-.07. The average of the 53 negative r's is 
— .21 (P.E. .08). Some of the negative coefficients are small 
enough to be due to chance, but it is much easier to believe that 
the 3 positive ones are due to chance. In view of the size of the 
negative coefficients and the unanimity of results from all the 
tests we are forced to conclude that the inverse correlation is 
genuine. Nor is this genuineness unsupported by previous ex- 
perimenters. Dr. Chapman. ('14), "Individual Differences in 
Ability and Improvement and Their Correlation," using the same 
Cancellation 2 and 3 tests upon twenty-two college students, found 
correlations between Cancellation and Mental Multiplication of a 
three-place by a three-place number as follows : .00, .03, .16, — .05, 
— .13, — .14. These coefficients will average a small negative. 

If future results substantiate our findings, what does it mean ? 
It means that a negative correlation can exist and that many 
more may exist than we at present suppose. There are those 
who believe that training in one mental function is transferred to 
another in proportion to the size of the positive correlation be- 
tween the two. If there be anything in such a belief, positive 
transfer accompanying a positive correlation may imply * a nega- 
tive transfer accompanying a negative correlation. Such a state 
of affairs existing would mean that to educate a person in one 
trait would be to uneducate him in all the traits correlating 
negatively with it. It is not impossible to conceive that some 
of the more or less trivial traits intensively developed by the 
schools correlate negatively with a hundred valuable abilities. 
The mere possibility argues for the future development of ex- 
perimental education. Our knowledge is very meagre. The 
wells which man has digged in the earth are far more numerous 
than the borings which psychology has made into the mental life. 
1 Such an implication is not necessarily true. 
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Though all these things be possible, we nevertheless believe 
with Dr. Thorndike that the law of human nature is correlation 
and not compensation. Although correlational psychology is a 
new science, it has several thousand coefficients to show for its 
labors. Never before, so far as the writer is informed, has a 
negative coefficient been so persistently in evidence. If inverse 
correlations were numerous, more should have made their ap- 
pa ranee by this time. Further, the negative correlations found 
in this research may not mean that the functions are intrinsically 
inverse. Had a sufficient reward been offered, it may be that 
the brighter pupils in the complex tests would have forged ahead 
in the Cancellation tests. In a simple test like Cancellation possi- 
bly the brighter children lost interest first. Quite conceivably, dif- 
ferent abilities have different interest and attention levels. Simple, 
routine, relatively easy tasks might be just right to interest the 
stupid, while they bored the abler individuals unutterably. Tasks 
difficult and complex enough to interest the abler individuals might 
be beyond the interest and attention of the stupid. A complete 
explanation of the cause would have to explain at the same time 
why the average from cancelling figures gave a negative correla- 
tion with the Composite while improvement at cancelling figures 
shows a slightly positive correlation with the Composite. 

(b) What bearing do our results have upon Spearman's Com- 
mon Factor? 

The reader will remember that just a few pages back we were 
so unwary as to become involved in a discussion of the cause 
for a negative correlation. Why mental functions correlate in 
any way, whether negatively or positively, is one of the most 
vital, most difficult, and most disputed problems with which corre- 
lational psychology has dealt. One step toward an explanation 
has been an attempt to determine the correlational grouping of 
mental traits. Here the question asked is : With respect to their 
intercor relations just how do the multitude of mental traits group 
themselves, into one system, two systems or many systems? 
Concerning this there are three different theories, the "multi- 
focal," the "intermediate," and the "unifocal." 

Spearman in an article entitled, "General Ability, Its Existence 
and Nature," published in Volume V of the British Journal of 
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Psychology, summarizes the "multifocal" theory, vis.: "Accord- 
ing to this view, ability in any performance depends upon a 
complex of elementary factors ; the correlation between two per- 
formances simply measures the degree in which the elementary 
factors demanded by the one happen to coincide with, or to be 
bound to, those demanded by the other. The elementary factors 
include both 'form' and 'content' ; by form is meant the kind of 
mental operation, as discrimination, observation, inference, etc. ; 
while the 'content' denotes the different sorts of data, as color, 
shape, number, etc., submitted to such operations." 

Between the "multifocal" and "unifocal" theories there are 
various intermediate ones which organize mental traits into a 
variety of "faculties," "centers," or "levels." Psychologists who 
classify the mental life into "types" or "faculties" imply that the 
multitude of functions composing any one "faculty" or "type" 
show a close correlation with one another while they show a loose 
correlation with traits which belong in a different "faculty," 
"type," or "center." Dr. Thorndike seems to believe in correla- 
tional "levels" when he writes: "Correlations seem to be closer 
within the analytical or abstracting functions than between these 
and others. So also within the purely mental associative functions 
like adding, completing words, giving opposites or naming objects, 
than between one of them and one of the sensori-motor functions. 
The sensivities seem to interrelate only loosely; and any one of 
them would relate very loosely to the associative or analytical 
functions, even when the latter was busied with data from that 
sense." 1 

The "unifocal" theory is represented by Dr. Spearman's famous 
"Common Factor." To quote from Spearman himself : "Here, 
the view supported is that all performances depend to a certain 
degree upon one and the same general common factor, provision- 
ally termed 'General Ability.' Correlations are thus produced 
between all sorts of performances, the amount of correlation 
being simply proportional to the extent that the performances 
concerned involve the use of this general common factor, or 'Gen- 
eral Ability.* " * This criterion proposes not as many centers as 
there are "elementary factors," not as many centers as there are 
"faculties" or "types," nor even as many centers as there are 
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"levels"; rather it proposes just one center. In the same article 
Spearman summarizes the importance of this question by saying: 
"This sharp divergence between the three current views appears 
to be of grave importance. It bars the way to all interpretation 
of our laboriously accumulated correlational data. It confuses 
all theory as to the intellectual 'make-up' of individuals. And it 
paralyzes our practical power of gauging the intelligence of per- 
sons, both normal and insane." Following this statement Spear- 
man proceeds to give his proof of the existence of the "Common 
Factor" and of the inadequacy of all previous conceptions. After 
many psychological considerations he decides that the "Common 
Factor" is "some common fund of energy." Finally he concludes 
with: 

"(l) At present, there exists such a great divergence of opinion 
about the correlation between different intellectual performances, 
as to impede gravely the progress of psychology. 

"(2) But closer consideration of all the actual data of the 
different authors shows that this divergence is merely due to 
gross misinterpretation. In reality, all the facts indicate unani- 
mously, that the correlation arises through all the performances, 
however different, depending partly on a General Common Fac- 
tor." 

Do our results support Spearman's contention and justify 
his conclusions? The first evidence we have to offer is the nega- 
tive correlation between the Cancellation group and the Complex 
tests. Correlation, according to Spearman, is produced by the 
General Common Factor and modified by the "specific abilities" 
of the traits correlated. To quote again : " . . . every intellectual 
performance may be regarded as proceeding from two distinct 
factors ; on the one hand, the specific ability or disposition for 
that particular performance; and on the other general ability, 
due to the common fund of intellective energy." What Spearman 
meant by "specific ability" may be gathered from these quotations : 
"An 'ear' for melody is known to be particularly specific, that is, 
independent of other elementary capacities." And again, "... 
their correlations (specific) do not occur in a pure state, but 
only superposed upon correlation of a more general character." 
The theory of the Common Factor seems to require that all 
coefficients of correlation be positive. How two functions can 
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share in a Common Factor and yet show a negative correlation 
we are unable to see. Perhaps the Cancellation traits are ostra- 
cized from the exclusive society of the Common Factor. Perhaps 
in the tug of war the "specific abilities," heading in a negative 
direction, outpulled the Common Factor. The proved skill of 
Dr. Spearman could doubtless defend his theory from such a 
trivial attack. 

In the article already referred to, Dr. Spearman proposes a 
remarkably ingenious and important method of treating correla- 
tional results. By this method he proved to his satisfaction the 
existence of a Common Factor, hence the fate of his theory de- 
pends upon the proper working of this method. We purpose to 
treat our results by exactly the same method to see whether they 
justify a belief in a General Common Factor. In his article 
Spearman gave a correlational table which had the general form 
of the one given below. (The coefficients are not the same.) 
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Concerning the table of coefficients which Spearman gave, he 
wrote : "The most obvious method would be to devise as criterion 
some direct function of all the coefficients in the table. We have, 
however, chosen a somewhat different course. It seemed desir- 
able to retain the power of noting whether the whole table 
obeyed the same law or different parts of it behaved differently. 
Also we were anxious to simplify the calculations as far as 
possible, in order to appeal to a wider circle of readers. For these 
reasons, our criterion was based upon singling out from the table 
any pair of columns of coefficients. . . . Our criterion consists 
simply in the correlation between one column of figures and the 
other ; it is the correlational coefficient between the two series of 
correlational coefficients ; clearly this is just as easy to work out 
as between any other two series of values. It should be noted 
that this correlation between columns is quite independent of the 
arrangement in which the table happens to have been drawn 
up." 

Also Spearman tells us that he threw away the two coefficients 
which had no corresponding coefficients in the other column. And 
then, a few pages further on, he says : "Such, then, is the statisti- 
cal method which we have devised for deciding between the three 
rival theories. If the older view of Thorndike, viz., a general 
independence of all correlations, holds good, our correlation be- 
tween columns of correlational coefficients should average about o. 
If his newer view of "levels" or the almost universal belief in 
"types" is correct, then the mean correlation between columns 
should be a low minus value. If, finally, the true theory is that 
of a General Common Factor, the correlation between columns 
should be positive and very high." 

Since Spearman's method has been applied to average raw 
coefficients it is highly desirable that the halves of a test from 
which the coefficients were derived measure substantially the 
same thing. Otherwise an average of the raw coefficients would 
be somewhat misleading. To this end, no test has been used 
which did not show a reliability coefficient of +.70. According 
to Table G this criterion eliminates Arithmetic and Reading. 
Visual Vocabulary and Completion were combined, thus raising 
their reliability coefficient to +.69, which was accepted as satis- 
factory. The intercorrelations of the accepted tests are given 
in Table L. It is upon this table that we purpose to test the 
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Spearman theorem. The reliability criterion was set up and the 
correlation table was constructed before it ever occurred to the 
writer to enquire whether it would operate favorably or un- 
favorably to the "Common Factor." 

Now, if Spearman's "unifocal" or "Common Factor" theory is 
to be corroborated, the correlation between any two columns of 
Table L should be, to use his own words, "positive and very 
high." To be exact, Spearman says the average of all the corre- 
lations should be positive and very high. But Spearman himself 
would be the first to say that unless all parts of the table sub- 
stantially agree, the use of an average would conceal rather than 
reveal the truth. He perceived this when he wrote : "It seemed 
desirable to retain the power of noting whether the whole table 
obeyed the same law or different parts of it behaved differently." 
It cannot be emphasized too strongly that, according to Spear- 
man's statistical method, the crucial thing, in the last analysis, 
is not the size of the average; it is the size of the correlation 
between any two columns taken from the correlational table. 
Bearing this in mind, is the correlation between any two columns 
of Table L "positive and very high," or does it tend even to be 
"positive and very high" ? Taking various pairs of perpendicular 
columns from Table L and correlating them we get such results 
as the following: 

Cancelling 2 with Visual Vocabulary ■(■ Completion — .95 

Cancelling 3 with Omnibus — .95 

Cancelling A with Teacher Rank —.83 

Cancelling S with Composite — .91 

Any one of the tests shown to the left paired with any one of 
(he tests at the right would give similar coefficients to the above. 
The results are just exactly opposite to what is required to satisfy 
Spearman's theory. Instead of the coefficients being "positive and 
very high" they are negative and very high. What then led 
Spearman to believe in a Common Factor ? The answer is given 
in the following: 

Cancelling 2 with Cancelling 3 -j-1.00 

Omnibus with Visual Vocabulary + Completion -)- ,00 

Many more such high positives could be given. Mere inspec- 
tion of Table L will show that the correlation between any two 
columns from Cancelling 2 through Cancelling S would give a high 
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positive. A high positive coefficient would also be gotten from 
any pair from Visual Vocabulary -|- Completion through Com- 
posite. On the other hand, the correlation of any column in the 
first group with any column in the second would be a high nega- 
tive. What would the average be? A mistake! 

Lest anyone should think that the coefficients from correlated 
columns always approximate unity, note the following smaller 
coefficients : 

Handwriting with School Mark — ,56 

Cancelling 2 with Copying Addresses .00 

Addition with Visual Vocabulary ■+ Completion +.51 

Between -f-S 1 an ^ — -5^ other intermediate coefficients could 
be given. By the proper selection of columns to be correlated, 
data could be found to support all of the three main theories, the 
"multifocal," the "faculty" or "type" or "level," and the "uni- 
focal." 

Objections will be urged against our correlational table (Table 
L). It could easily be said that Teacher Rank does not measure 
a mental trait at all, unless perhaps it be a mental trait of the 
teacher, and therefore such a measure should not be included in 
the table of correlations. It was retained because Dr. Spearman 
speaks of using "Imputed Intelligence" in his tables. But the 
omission of Teacher Rank would not change the general con- 
clusion. 

The only really important criticism would concern itself with 
the number of the Cancellation tests. Spearman would probably 
say that because of them our table is overloaded with "specific 
abilities." He himself combined two Cancellation tests which 
occurred in one of his tables, though he offered no justification 
for such a procedure, except that the tests were similar. If the 
tests were practically identical there could be no objection to 
his combining them. Likewise it would be difficult to protest 
had he elected to treat them separately, for they were not exactly 
the same test. If correlation be due to "specific ability" plus 
"Common Factor," we should not forget the work of Thorndike 
and Woodworth. They have shown experimentally that traits 
which seem almost identical may really not be so at all. If 
external similarity be our measure of "specific ability," the corre- 
lation between Cancelling A and Cancelling S would be higher 
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than between Visual Vocabulary -|- Completion and Omnibus. 
As a matter of fact, the correlation is — |— .57 in the first case and 
-J-.60 in the second. There is no more reason for combining these 
two Cancellation tests than for combining the Visual Vocabulary 
I ■ Completion and Omnibus. But supposing we yield the point 
and retain only Cancellation 2 and Cancellation A, then the re- 
maining columns can be correlated to give a result like this : 

Cancellation 2 with Omnibus — .94 

But to be still more generous, we have thrown out every Cancel- 
lation test except Cancelling 2 ; yet we can get a result like this : 

Cancelling 2 with Omnibus —.92 

In view of the foregoing we are forced to conclude that Spear- 
man's theory does not have universal validity. And we have 
proved this by the application of his own statistical method. Dr. 
Spearman certainly bases his theory upon numerous data col- 
lected from many sources. His averages certainly were positive 
and high, and he explicitly states that no individual correlation 
of column with column fell appreciably below positive unity. Had 
we correlated every column in Table L with every other column 
and had we taken an average of all these correlations, the mean 
result would have been a substantial positive. But in view of the 
differential action of different parts of the table, such a summa- 
tion would be not only misleading but wrong. 

Dr. Spearman after advancing and defending his theory of 
the Common Factor proceeds to state the nature of it. Concern- 
ing the former, Burt writes : "The first of Dr. Spearman's propo- 
sitions, the "Theorem of the Universal Unity of the Intellective 
Function' is tested by a corollary logically issuing from it, called 
that of the 'Hierarchy of the Specific Intelligences.' Its principle 
may be most briefly expressed as follows : 

r (A, P) _ r (A, g) 
r (B, P) r (B, Q) 

where A, B, P, Q, represent any four capacities not obviously 
akin. 1 When this formula is satisfied a correlational table can 
be so drawn up that the coefficients in horizontal columns grow 
1 British Journal of Psychology, VoL III, p. 159. 



^Google 



Consideration of Problems 6$ 

smaller to the right and those in perpendicular columns grow 
smaller downward. Burt's coefficients did substantially satisfy 
the above formula, and when thrown into the usual table they 
formed a beautiful 'hierarchy.' Consequently, Burt agreed with 
Spearman's first theorem. The 'Common Factor' and the 'Hier- 
archy of the Specific Intelligences' must stand or fall together. 
Just as our results do not corroborate Spearman's contention, 
neither can our coefficients be so arranged as to show a hierarchy. 
Burt, like Spearman, claims that the above formula only holds 
when the capacities are "not obviously akin." This is the crucial 
point. We are insisting that external similarity is not a satis- 
factory measure of kinship. But even when we yielded to ex- 
ternal similarity so far as to eliminate every Cancellation test 
except one, our results failed to substantiate Spearman's 'Com- 
mon Factor' or Burt's 'Hierarchy of the Specific Intelligences.' 
Complete fairness to Dr. Spearman makes another remark 
necessary. Spearman points out that what he calls "sampling 
errors" introduce a definite bias into the results obtained by cor- 
relating columns of coefficients, and that to determine the exact 
size of the coefficient this bias must be corrected for by a for- 
mula which he gives. In order that the correction may not be 
so great as to swamp the real difference, he sets up an arbitrary 
correctional standard by which he excludes those columns which 
have large sampling errors. Unfortunately, we have been unable 
to make clear to ourselves just how he applies this standard, 
hence our correlational table has been left unmodified. For this 
reason we do not correct our results by his formula but present 
them in their raw form. Anyway, the exact size of the coefficient 
is not necessary to test Spearman's theory. And even though 
Spearman finds that some column used by us did not quite 
satisfy his correctional standard, it is hardly conceivable that the 
sampling error could be so large as to completely reverse the 
direction of the coefficients upon which our conclusion is based. 
In correlating two columns from a correlational table, two 
coefficients must be thrown away, one from each column. This is 
necessary because there will always be one coefficient in each col- 
umn which lacks a corresponding coefficient in the other. But 
what is worse still is that every time a new pairing of columns 
is made different coefficients are eliminated. This increases enor- 
mously the labor of calculating the intercorrelation among the 
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columns, for with each new pairing a new average, a new set of 
deviations, and a new sum of deviations squared must be calcu- 
lated. In calculating the Pearson coefficients for ordinary arrays 
these things are done but once. To minimize labor, therefore, 
we suggest that the coefficient -|- 1.00 be inserted at every place 
in the correlation table where there is a gap. An array will, of 
course, always correlate -j-1.00 with itself. This coefficient is 
usually omitted in drawing up a correlational table because to 
insert it would not he particularly illuminating. Where, however, 
we wish to apply Spearman's statistical method such an inser- 
tion would prove exceedingly serviceable. We did not use the 
■ f-1.00 in calculating any of the coefficients used in our attempt 
to refute the two theories of Burt and Spearman. We believe 
that to fill up the gaps in a correlational table in this way is 
theoretically correct. In every case where we have tried correlat- 
ing columns with and without the -{-1.00 the coefficient has been 
very nearly the same. But even though the coefficients were not 
the same, the insertion of the +1.00 might still be justifiable. We 
merely mention it here in the hope that some one with sufficient 
training in the theory of correlation will test our suggestion. 
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CONCLUSION 

The mere wording of a question may stimulate thinking which 
will result in experimental research. It is our only excuse for 
asking so many questions and giving a final answer to so few. 
Certain conclusions grow out of this study, but the amount of 
data in any one research is necessarily so meagre that universal 
validity can scarcely be claimed for any of them. But in view 
of the limitations of the study, the following seem to us worth 
a place in a summary : 

i. The corrected correlations among the educational and psy- 
chological tests and the functions which they measure contin- 
uously vary in size from — .63 to +.98. 

2. Meaning by mental ability a Composite of all the .measure- 
ments, the Omnibus and Completion tests correlate with it +1.00 
and +.96, respectively. That is to say, a perfect measure of an 
individual by Omnibus or Completion would be a substantially 
true index of his mental ability. 

3. The seven best measures of mental ability together with 
their correlations with the Composite are: Omnibus 1.00, Com- 
pletion .96, School Mark .91, Teacher Rank .86, Reading .81, 
Visual Vocabulary .80, and Arithmetic .72. 

4. Ranked in the order of their correlation with mental ability 
the complex educational and vocational tests come first, the rela- 
tively complex practice tests second, and the simple practice tests 
last 

5. The power tests, or those which measured the upper 
threshold of ability, showed a higher correlation with mental 
ability than the speed tests or those which measured how rapidly 
a relatively easy task could be accurately performed. The power 
tests were superior not only as to correlation but also as to time 
required and the distribution of that time. 
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6. The indications are that for a test to show a close correla- 
tion with mental ability it should emphasize power rather than 
speed and test a relatively complex function rather than a narrow 
mental trait. 

7. Improvement at a speed, practice test was on the whole not 
so good an intellectual index as an average of the practice scores 
and not nearly so good an index as a single score from a complex, 
power test. 

8. In this particular 6 B school grade chronological age corre- 
lated negatively with mental ability. 

9. The Cancellation tests correlated negatively not only with 
the Composite but also with all those tests which proved to be 
good measures of mental ability. This demonstrates that a nega- 
tive correlation between apparently desirable traits can exist. 
Heretofore, the weight of scientific evidence has been against 
such a possibility. 

10. The correlation between columns of correlational coeffic- 
ients does not corroborate Spearman's important "Theorem of 
the Universal Unity of Intellective Function." 

\ 11. In no way can a correlation table be so constructed from 
our coefficients as to satisfy Burt's "Hierarchy of the Specific 
Intelligences." 

12. A suggestion was made whereby gaps in a table of co- 
efficients can be filled. This suggestion, if justifiable, will greatly 
economize labor in applying to a table of coefficients Spearman's 
statistical method of correlating columns of correlational co- 
efficients. 
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General Instructions for the Six Preliminary and Six Final Tests : 

I am going to give you several tests to find out how good a score 
you can make. Do your best in each test To-morrow I shall read the 
names of the two making the highest total scores. Notice carefully all 
instructions so you will not need to ask questions and thus disturb others. 

(Read before each series.) 



There will be placed before you, face down, a sheet of paper. This 
paper tells you what to do and how to do it. You will have 30 minutes 
in which to complete the test. When you have finished everything on the 
paper, bring it to me and return quietly to your seat. Don't look at 
your paper until I say "Go," and stop instantly when I say "Stop." Do 
what it says to do. 

(Read before each test.) (Proverb: 15 min.) 

Instructions for Cancellation : 

You will be given a cancellation sheet. In this sheet a certain specified 
number or letter must be cancelled. Omit as few cases and cancel as 
many as you can in one minute. The sheet will be placed before you 
bottom-side up. When I say "Go," turn the sheet over and commence to 
cancel- When I say "Stop," cease immediately. Your score will be as 
follows: 2 (number cancelled correctly) minus 2 (number omitted) minus 
3 (number wrongly marked). Watch white I show how it should be done 
and then you can practice at it yourself for one minute. 

Instructions for Addition : 

You will be given a sheet containing columns of one-place numbers. 
Place it before you bottom-side up. When I say "Go," turn the sheet 
over and begin adding. Write the sum of each column of ten figures 
under the line at the bottom of that column. Add as many columns as 
you can in ten minutes without making errors. If an answer is wrong 
you will receive no credit for that column. When you finish the examples 
on one sheet take another. Watch while I show you how it is done and 
then you can practice it yourself for five minutes. 

Instructions for Copying Addresses: 
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on your sheet, find the New York City address and write it after that 
name on your sheet See how many of these addresses you can correctly 
copy on your sheet in ten minutes. Do not begin until I say "Go," and 
cease immediately when I say "Stop." Watch while I show you how it 
should be done. 

Instructions for Handwhiting: 

There will be placed before you face downward a printed paragraph 
which you are to copy as much of as you can in four minutes. You will 
be scored for both quality and speed, so write as fast as you can while 
writing the best that you can. Be sure to punctuate and capitalize just 



Teachers College, Columbia University, publishes the Visual Vocabu- 
lary, Reading, and Completion tests. Further information concerning 
the other tests may be had by communicating with the author. 
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Cancelling A : Original scores made in i minute by 88 children. 

1/4 2/5 1/8 2/» MO 9/u va vn V« l/is V 1 * */« Vis */u 



q as si 



7) M 84 60 » 



40 48 46 



8 S3 H 48 74 74 
60 S3 58 88 74 74 



-28 82 38 



28 SO 10 38 



88 74 74 74 70 



78 00 48 88 
54 GO 88 74 

88 70 «2 98 



88 74 78 78 48 44 20 



80 00 84 88 



71 88 H 88 74 



80 54 78 74 



42 82 88 88 71 



If 80 38 



04 B2 10O M 100 

- 78 88 80 88 



N 60 72 



74 80 78 78 88 

62 08 80 84 00 



SO 34 28 48 84 

48 SO 88 86 48 

44 — a 48 88 

808462844882600244 

566468 70 74B49888»e*2 — 

6862 74 68 71 61 «8 - — — - 

88 82 72 74 70 76 70 74 74 76 74 

50 M 60 72 72 76 73 71 88 SB M 

842888 46 428256 — 40 6680 

40526268 70 78 48648888 — 



32 86 40 

a) io Kl 88 38 88 
22 38 40 48 82 8! 



M 48 SB 80 72 



84 38 34 88 48 



76 08 84 00 26 100 



100 102 118 



888844448460 76 H 

36 73 63 60 74 74 — ~ 
14 40 42 80 86 88 62 



80 82 U 88 71 70 

608880828448848868 
6882 — $204669492*8 



38 38 28 88 



60 100 84 



84 66 62 64 



46 88 62 82 86 72 74 68 74 82 64 

SO 48 46 48 (2 56 46 60 82 



83 88 68 H 80 

84 88 38 82 84 



18 20 40 



18 26 18 



SO 42 68 86 48 46 48 48 



48 80 60 SO 72 

Bl 82 74 74 74 

48 SO 40 M 47 

82 12 38 48 88 _. 

W 48 44 46 SO 48 



78 70 94 68 



66 88 61 68 68 74 70 

48 48 84 ~ - - " 



62 82 82 72 66 



88 60 70 72 



88 84 52 (4 



41 84 43 50 



.. 61 64 68 72 76 
SO 68 68 52 68 78 70 
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Cancelling A (cmtinutd) 

Ind. 1/* 3/iS 1/8 2/9 t/10 3/11 1/1S Vie V" 3/18 4/14 4/16 4/1* 4/1S 

7! 83 S» 48 71 74 90 94 92 94 90 90 100 1W 103 



40 48 42 SI SB (4 
M M S3 40 48 42 

~ si "to oi 

70 M 93 94 91 98 H 



— H -47 34 it SO 48 

OS 70 03 04 03 00 

9S H 88 78 H 100 

70 M 00 W S3 74 



81 40 BO 40 40 34 48 

84 304S41M438SHSOCBCS70SOS383 

SS SOBS — ~ — — — — — — — 43 — M 

W 4843M»M03E800HMT4MB473 

87 33444OtlS000MSB5171S3SO4O — 

88 24 38 38 30 40 BO 48 38 48 SO 60 40 M 58 
St (0643040510860 — — — — — — — 

tt 4003t4 70 08780884 74 84ni00 100 104 

H 84 — 38 61 SO SO 08 08 00 83 76 M ~ — 

13 38 54 SO 41 44 SB 48 46 46 64 63 S3 48 ~ 

tt 40 04 SO 04 07 «4 04 74 74 — 74 70 84 S3 

Cancelling S : Original scores made in i minute by SS children. 

Iod. l/H 3/1S 1/10 3/17 3/18 4/14 4/16 4/10 4/10 
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1/11 1/1S 1/U 3/17 1/18 4/14 



— — — w 



Cancelling 2: Original s< 



a made in one minute by SS children. 
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Cancellling 3 {continued) 
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Cancelling 3 (continued) 
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Addition : Original scores nude in id minutes by 88 children. 

Ind. 3/4 2/S 5/8 2/1 2/10 1/11 2/18 IAS 2/17 VIS 
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Addition (continued) 



Copying Addresses: Original s 



s made in 10 minutes by 88 chil- 



i/» i/jj j/M im tm a/i a/i 1/3 
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in* J/33 s/a* a/as a/u s/i 1/1 »/J a/* 



4/so 4/» 4/aa t/n i/M 4/w 
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Copying Addresses (continued) 
Ind. 1/14 vis vi« m/i vso 4/11 
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Handwriting : Original scores nude in 4 minutes by 88 children. 
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Hand whiting (continued) 
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